Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 2935849 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 6 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 224.0 MiB |
| Average record size in memory | 80.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 6 |
| Text | 3 |
| Dataset has 6 (< 0.1%) duplicate rows | Duplicates |
item_cnt_day is highly skewed (γ1 = 272.8331617) | Skewed |
date_block_num has 115690 (3.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-08 13:35:38.398275 |
|---|---|
| Analysis finished | 2024-04-08 13:36:12.449031 |
| Duration | 34.05 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
date
Date
| Distinct | 1034 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| Minimum | 2013-01-01 00:00:00 |
|---|---|
| Maximum | 2015-12-10 00:00:00 |
Histogram with fixed size bins (bins=50)
date_block_num
Real number (ℝ)
ZEROS 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.569911 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 115690 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 14 |
| Q3 | 23 |
| 95-th percentile | 31 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.4229877 |
|---|---|
| Coefficient of variation (CV) | 0.64674296 |
| Kurtosis | -1.082869 |
| Mean | 14.569911 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.20385795 |
| Sum | 42775060 |
| Variance | 88.792697 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=34)
| Value | Count | Frequency (%) |
| 11 | 143246 | 4.9% |
| 23 | 130786 | 4.5% |
| 2 | 121347 | 4.1% |
| 0 | 115690 | 3.9% |
| 1 | 108613 | 3.7% |
| 7 | 104772 | 3.6% |
| 6 | 100548 | 3.4% |
| 5 | 100403 | 3.4% |
| 12 | 99349 | 3.4% |
| 10 | 96736 | 3.3% |
| Other values (24) | 1814359 |
| Value | Count | Frequency (%) |
| 0 | 115690 | |
| 1 | 108613 | |
| 2 | 121347 | |
| 3 | 94109 | |
| 4 | 91759 | |
| 5 | 100403 | |
| 6 | 100548 | |
| 7 | 104772 | |
| 8 | 96137 | |
| 9 | 94202 |
| Value | Count | Frequency (%) |
| 33 | 53514 | |
| 32 | 50588 | |
| 31 | 57029 | |
| 30 | 55549 | |
| 29 | 54617 | |
| 28 | 54548 | |
| 27 | 56274 | |
| 26 | 69977 | |
| 25 | 71808 | |
| 24 | 88522 |
shop_id
Real number (ℝ)
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.001728 |
| Minimum | 0 |
|---|---|
| Maximum | 59 |
| Zeros | 9857 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 22 |
| median | 31 |
| Q3 | 47 |
| 95-th percentile | 57 |
| Maximum | 59 |
| Range | 59 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 16.226973 |
|---|---|
| Coefficient of variation (CV) | 0.4917007 |
| Kurtosis | -1.0253581 |
| Mean | 33.001728 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.072361429 |
| Sum | 96888091 |
| Variance | 263.31465 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 31 | 235636 | 8.0% |
| 25 | 186104 | 6.3% |
| 54 | 143480 | 4.9% |
| 28 | 142234 | 4.8% |
| 57 | 117428 | 4.0% |
| 42 | 109253 | 3.7% |
| 27 | 105366 | 3.6% |
| 6 | 82663 | 2.8% |
| 58 | 71441 | 2.4% |
| 56 | 69573 | 2.4% |
| Other values (50) | 1672671 |
| Value | Count | Frequency (%) |
| 0 | 9857 | 0.3% |
| 1 | 5678 | 0.2% |
| 2 | 25991 | 0.9% |
| 3 | 25532 | 0.9% |
| 4 | 38242 | |
| 5 | 38179 | |
| 6 | 82663 | |
| 7 | 58076 | |
| 8 | 3412 | 0.1% |
| 9 | 3751 | 0.1% |
| Value | Count | Frequency (%) |
| 59 | 42108 | 1.4% |
| 58 | 71441 | |
| 57 | 117428 | |
| 56 | 69573 | |
| 55 | 34769 | 1.2% |
| 54 | 143480 | |
| 53 | 52921 | 1.8% |
| 52 | 43502 | 1.5% |
| 51 | 44433 | 1.5% |
| 50 | 65173 |
item_id
Real number (ℝ)
| Distinct | 21807 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10197.227 |
| Minimum | 0 |
|---|---|
| Maximum | 22169 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1540 |
| Q1 | 4476 |
| median | 9343 |
| Q3 | 15684 |
| 95-th percentile | 20949 |
| Maximum | 22169 |
| Range | 22169 |
| Interquartile range (IQR) | 11208 |
Descriptive statistics
| Standard deviation | 6324.2974 |
|---|---|
| Coefficient of variation (CV) | 0.62019776 |
| Kurtosis | -1.22521 |
| Mean | 10197.227 |
| Median Absolute Deviation (MAD) | 5492 |
| Skewness | 0.25717355 |
| Sum | 2.9937519 × 1010 |
| Variance | 39996737 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 20949 | 31340 | 1.1% |
| 5822 | 9408 | 0.3% |
| 17717 | 9067 | 0.3% |
| 2808 | 7479 | 0.3% |
| 4181 | 6853 | 0.2% |
| 7856 | 6602 | 0.2% |
| 3732 | 6475 | 0.2% |
| 2308 | 6320 | 0.2% |
| 4870 | 5811 | 0.2% |
| 3734 | 5805 | 0.2% |
| Other values (21797) | 2840689 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 6 | |
| 2 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 22169 | 1 | < 0.1% |
| 22168 | 6 | < 0.1% |
| 22167 | 1114 | |
| 22166 | 270 | < 0.1% |
| 22165 | 2 | < 0.1% |
| 22164 | 408 | < 0.1% |
| 22163 | 71 | < 0.1% |
| 22162 | 560 | |
| 22161 | 1 | < 0.1% |
| 22160 | 49 | < 0.1% |
item_price
Real number (ℝ)
| Distinct | 19993 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 890.85323 |
| Minimum | -1 |
|---|---|
| Maximum | 307980 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 99 |
| Q1 | 249 |
| median | 399 |
| Q3 | 999 |
| 95-th percentile | 2690 |
| Maximum | 307980 |
| Range | 307981 |
| Interquartile range (IQR) | 750 |
Descriptive statistics
| Standard deviation | 1729.7996 |
|---|---|
| Coefficient of variation (CV) | 1.9417336 |
| Kurtosis | 445.53283 |
| Mean | 890.85323 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 10.750423 |
| Sum | 2.6154106 × 109 |
| Variance | 2992206.8 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 299 | 291352 | 9.9% |
| 399 | 242603 | 8.3% |
| 149 | 218432 | 7.4% |
| 199 | 184044 | 6.3% |
| 349 | 101461 | 3.5% |
| 599 | 95673 | 3.3% |
| 999 | 82784 | 2.8% |
| 799 | 77882 | 2.7% |
| 249 | 77685 | 2.6% |
| 699 | 76493 | 2.6% |
| Other values (19983) | 1487440 |
| Value | Count | Frequency (%) |
| -1 | 1 | < 0.1% |
| 0.07 | 2 | < 0.1% |
| 0.0875 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 2932 | |
| 0.2 | 1 | < 0.1% |
| 0.5 | 1226 | |
| 0.9087136929 | 1 | < 0.1% |
| 0.99 | 493 | < 0.1% |
| 1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 307980 | 1 | < 0.1% |
| 59200 | 1 | < 0.1% |
| 50999 | 1 | < 0.1% |
| 49782 | 1 | < 0.1% |
| 42990 | 4 | |
| 42000 | 1 | < 0.1% |
| 41990 | 3 | |
| 40991 | 1 | < 0.1% |
| 40900 | 1 | < 0.1% |
| 37991 | 2 |
item_cnt_day
Real number (ℝ)
SKEWED 
| Distinct | 198 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2426409 |
| Minimum | -22 |
|---|---|
| Maximum | 2169 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 7356 |
| Negative (%) | 0.3% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | -22 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 2169 |
| Range | 2191 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.6188344 |
|---|---|
| Coefficient of variation (CV) | 2.1074749 |
| Kurtosis | 177478.1 |
| Mean | 1.2426409 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 272.83316 |
| Sum | 3648206 |
| Variance | 6.8582938 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 2629372 | |
| 2 | 194201 | 6.6% |
| 3 | 47350 | 1.6% |
| 4 | 19685 | 0.7% |
| 5 | 10474 | 0.4% |
| -1 | 7252 | 0.2% |
| 6 | 6338 | 0.2% |
| 7 | 4057 | 0.1% |
| 8 | 2903 | 0.1% |
| 9 | 2177 | 0.1% |
| Other values (188) | 12040 | 0.4% |
| Value | Count | Frequency (%) |
| -22 | 1 | < 0.1% |
| -16 | 1 | < 0.1% |
| -9 | 1 | < 0.1% |
| -6 | 2 | < 0.1% |
| -5 | 4 | < 0.1% |
| -4 | 3 | < 0.1% |
| -3 | 14 | < 0.1% |
| -2 | 78 | < 0.1% |
| -1 | 7252 | 0.2% |
| 1 | 2629372 |
| Value | Count | Frequency (%) |
| 2169 | 1 | |
| 1000 | 1 | |
| 669 | 1 | |
| 637 | 1 | |
| 624 | 1 | |
| 539 | 1 | |
| 533 | 1 | |
| 512 | 1 | |
| 508 | 1 | |
| 504 | 1 |
shop_name
Text
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 31 |
| Mean length | 22.517262 |
| Min length | 14 |
Characters and Unicode
| Total characters | 66107281 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ярославль ТЦ "Альтаир" |
|---|---|
| 2nd row | Москва ТРК "Атриум" |
| 3rd row | Москва ТРК "Атриум" |
| 4th row | Москва ТРК "Атриум" |
| 5th row | Москва ТРК "Атриум" |
| Value | Count | Frequency (%) |
| тц | 1651630 | 15.9% |
| москва | 996636 | 9.6% |
| мега | 544689 | 5.3% |
| трц | 387410 | 3.7% |
| ii | 284579 | 2.7% |
| тк | 252032 | 2.4% |
| семеновский | 235636 | 2.3% |
| трк | 234360 | 2.3% |
| якутск | 204404 | 2.0% |
| атриум | 186104 | 1.8% |
| Other values (101) | 5383085 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7424716 | 11.2% | |
| " | 5170714 | 7.8% |
| а | 4306636 | 6.5% |
| о | 3616165 | 5.5% |
| к | 3128346 | 4.7% |
| е | 2993274 | 4.5% |
| Т | 2876617 | 4.4% |
| с | 2691262 | 4.1% |
| в | 2419562 | 3.7% |
| Ц | 2361816 | 3.6% |
| Other values (63) | 29118173 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66107281 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7424716 | 11.2% | |
| " | 5170714 | 7.8% |
| а | 4306636 | 6.5% |
| о | 3616165 | 5.5% |
| к | 3128346 | 4.7% |
| е | 2993274 | 4.5% |
| Т | 2876617 | 4.4% |
| с | 2691262 | 4.1% |
| в | 2419562 | 3.7% |
| Ц | 2361816 | 3.6% |
| Other values (63) | 29118173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66107281 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7424716 | 11.2% | |
| " | 5170714 | 7.8% |
| а | 4306636 | 6.5% |
| о | 3616165 | 5.5% |
| к | 3128346 | 4.7% |
| е | 2993274 | 4.5% |
| Т | 2876617 | 4.4% |
| с | 2691262 | 4.1% |
| в | 2419562 | 3.7% |
| Ц | 2361816 | 3.6% |
| Other values (63) | 29118173 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66107281 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7424716 | 11.2% | |
| " | 5170714 | 7.8% |
| а | 4306636 | 6.5% |
| о | 3616165 | 5.5% |
| к | 3128346 | 4.7% |
| е | 2993274 | 4.5% |
| Т | 2876617 | 4.4% |
| с | 2691262 | 4.1% |
| в | 2419562 | 3.7% |
| Ц | 2361816 | 3.6% |
| Other values (63) | 29118173 |
item_name
Text
| Distinct | 21807 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
Length
| Max length | 150 |
|---|---|
| Median length | 104 |
| Mean length | 42.176862 |
| Min length | 2 |
Characters and Unicode
| Total characters | 123824899 |
|---|---|
| Distinct characters | 165 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 2371 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | ЯВЛЕНИЕ 2012 (BD) |
|---|---|
| 2nd row | DEEP PURPLE The House Of Blue Light LP |
| 3rd row | DEEP PURPLE The House Of Blue Light LP |
| 4th row | DEEP PURPLE Who Do You Think We Are LP |
| 5th row | DEEP PURPLE 30 Very Best Of 2CD (Фирм.) |
| Value | Count | Frequency (%) |
| версия | 725653 | 4.0% |
| русская | 680070 | 3.7% |
| pc | 445501 | 2.4% |
| jewel | 316847 | 1.7% |
| ps3 | 230252 | 1.3% |
| bd | 216320 | 1.2% |
| 3 | 210744 | 1.2% |
| регион | 199715 | 1.1% |
| xbox | 188717 | 1.0% |
| 2 | 175655 | 1.0% |
| Other values (19552) | 14926755 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16075953 | 13.0% | |
| р | 3867036 | 3.1% |
| с | 3693825 | 3.0% |
| и | 3603497 | 2.9% |
| а | 3465014 | 2.8% |
| е | 3443034 | 2.8% |
| e | 3295768 | 2.7% |
| о | 2550540 | 2.1% |
| к | 2320910 | 1.9% |
| я | 2287092 | 1.8% |
| Other values (155) | 79222230 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 123824899 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 16075953 | 13.0% | |
| р | 3867036 | 3.1% |
| с | 3693825 | 3.0% |
| и | 3603497 | 2.9% |
| а | 3465014 | 2.8% |
| е | 3443034 | 2.8% |
| e | 3295768 | 2.7% |
| о | 2550540 | 2.1% |
| к | 2320910 | 1.9% |
| я | 2287092 | 1.8% |
| Other values (155) | 79222230 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 123824899 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 16075953 | 13.0% | |
| р | 3867036 | 3.1% |
| с | 3693825 | 3.0% |
| и | 3603497 | 2.9% |
| а | 3465014 | 2.8% |
| е | 3443034 | 2.8% |
| e | 3295768 | 2.7% |
| о | 2550540 | 2.1% |
| к | 2320910 | 1.9% |
| я | 2287092 | 1.8% |
| Other values (155) | 79222230 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 123824899 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 16075953 | 13.0% | |
| р | 3867036 | 3.1% |
| с | 3693825 | 3.0% |
| и | 3603497 | 2.9% |
| а | 3465014 | 2.8% |
| е | 3443034 | 2.8% |
| e | 3295768 | 2.7% |
| о | 2550540 | 2.1% |
| к | 2320910 | 1.9% |
| я | 2287092 | 1.8% |
| Other values (155) | 79222230 |
item_category_id
Real number (ℝ)
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.001383 |
| Minimum | 0 |
|---|---|
| Maximum | 83 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 28 |
| median | 40 |
| Q3 | 55 |
| 95-th percentile | 71 |
| Maximum | 83 |
| Range | 83 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 17.100759 |
|---|---|
| Coefficient of variation (CV) | 0.42750418 |
| Kurtosis | -0.52515786 |
| Mean | 40.001383 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.31828252 |
| Sum | 1.1743802 × 108 |
| Variance | 292.43594 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40 | 564652 | |
| 30 | 351591 | |
| 55 | 339585 | |
| 19 | 208219 | 7.1% |
| 37 | 192674 | 6.6% |
| 23 | 146789 | 5.0% |
| 28 | 121539 | 4.1% |
| 20 | 79058 | 2.7% |
| 63 | 53845 | 1.8% |
| 65 | 53227 | 1.8% |
| Other values (74) | 824670 |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 1 | 2 | < 0.1% |
| 2 | 18461 | |
| 3 | 25283 | |
| 4 | 2304 | 0.1% |
| 5 | 7231 | 0.2% |
| 6 | 18498 | |
| 7 | 4459 | 0.2% |
| 8 | 1877 | 0.1% |
| 9 | 2193 | 0.1% |
| Value | Count | Frequency (%) |
| 83 | 7206 | 0.2% |
| 82 | 4390 | 0.1% |
| 81 | 795 | < 0.1% |
| 80 | 1325 | < 0.1% |
| 79 | 9067 | 0.3% |
| 78 | 2346 | 0.1% |
| 77 | 3703 | 0.1% |
| 76 | 3746 | 0.1% |
| 75 | 42603 | |
| 74 | 56 | < 0.1% |
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 36 |
| Mean length | 20.538608 |
| Min length | 9 |
Characters and Unicode
| Total characters | 60298252 |
|---|---|
| Distinct characters | 93 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Кино - Blu-Ray |
|---|---|
| 2nd row | Музыка - Винил |
| 3rd row | Музыка - Винил |
| 4th row | Музыка - Винил |
| 5th row | Музыка - CD фирменного производства |
| Value | Count | Frequency (%) |
| 2905446 | ||
| игры | 1125431 | 9.8% |
| кино | 838291 | 7.3% |
| dvd | 564652 | 4.9% |
| pc | 506039 | 4.4% |
| издания | 486912 | 4.3% |
| музыка | 406737 | 3.6% |
| подарки | 370450 | 3.2% |
| стандартные | 351591 | 3.1% |
| cd | 347516 | 3.0% |
| Other values (90) | 3551360 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8518576 | 14.1% | |
| и | 3952343 | 6.6% |
| о | 3949536 | 6.6% |
| а | 3372489 | 5.6% |
| н | 3154622 | 5.2% |
| - | 3141462 | 5.2% |
| р | 2865626 | 4.8% |
| ы | 2672140 | 4.4% |
| г | 1843617 | 3.1% |
| д | 1758242 | 2.9% |
| Other values (83) | 25069599 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 60298252 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8518576 | 14.1% | |
| и | 3952343 | 6.6% |
| о | 3949536 | 6.6% |
| а | 3372489 | 5.6% |
| н | 3154622 | 5.2% |
| - | 3141462 | 5.2% |
| р | 2865626 | 4.8% |
| ы | 2672140 | 4.4% |
| г | 1843617 | 3.1% |
| д | 1758242 | 2.9% |
| Other values (83) | 25069599 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 60298252 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8518576 | 14.1% | |
| и | 3952343 | 6.6% |
| о | 3949536 | 6.6% |
| а | 3372489 | 5.6% |
| н | 3154622 | 5.2% |
| - | 3141462 | 5.2% |
| р | 2865626 | 4.8% |
| ы | 2672140 | 4.4% |
| г | 1843617 | 3.1% |
| д | 1758242 | 2.9% |
| Other values (83) | 25069599 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 60298252 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8518576 | 14.1% | |
| и | 3952343 | 6.6% |
| о | 3949536 | 6.6% |
| а | 3372489 | 5.6% |
| н | 3154622 | 5.2% |
| - | 3141462 | 5.2% |
| р | 2865626 | 4.8% |
| ы | 2672140 | 4.4% |
| г | 1843617 | 3.1% |
| д | 1758242 | 2.9% |
| Other values (83) | 25069599 |
| date_block_num | item_category_id | item_cnt_day | item_id | item_price | shop_id | |
|---|---|---|---|---|---|---|
| date_block_num | 1.000 | 0.013 | 0.003 | 0.009 | 0.137 | 0.022 |
| item_category_id | 0.013 | 1.000 | -0.015 | 0.414 | -0.405 | 0.028 |
| item_cnt_day | 0.003 | -0.015 | 1.000 | -0.004 | 0.046 | -0.002 |
| item_id | 0.009 | 0.414 | -0.004 | 1.000 | -0.324 | 0.031 |
| item_price | 0.137 | -0.405 | 0.046 | -0.324 | 1.000 | -0.051 |
| shop_id | 0.022 | 0.028 | -0.002 | 0.031 | -0.051 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| date | date_block_num | shop_id | item_id | item_price | item_cnt_day | shop_name | item_name | item_category_id | item_category_name | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 02.01.2013 | 0 | 59 | 22154 | 999.00 | 1.0 | Ярославль ТЦ "Альтаир" | ЯВЛЕНИЕ 2012 (BD) | 37 | Кино - Blu-Ray |
| 1 | 03.01.2013 | 0 | 25 | 2552 | 899.00 | 1.0 | Москва ТРК "Атриум" | DEEP PURPLE The House Of Blue Light LP | 58 | Музыка - Винил |
| 2 | 05.01.2013 | 0 | 25 | 2552 | 899.00 | -1.0 | Москва ТРК "Атриум" | DEEP PURPLE The House Of Blue Light LP | 58 | Музыка - Винил |
| 3 | 06.01.2013 | 0 | 25 | 2554 | 1709.05 | 1.0 | Москва ТРК "Атриум" | DEEP PURPLE Who Do You Think We Are LP | 58 | Музыка - Винил |
| 4 | 15.01.2013 | 0 | 25 | 2555 | 1099.00 | 1.0 | Москва ТРК "Атриум" | DEEP PURPLE 30 Very Best Of 2CD (Фирм.) | 56 | Музыка - CD фирменного производства |
| 5 | 10.01.2013 | 0 | 25 | 2564 | 349.00 | 1.0 | Москва ТРК "Атриум" | DEEP PURPLE Perihelion: Live In Concert DVD (Кир.) | 59 | Музыка - Музыкальное видео |
| 6 | 02.01.2013 | 0 | 25 | 2565 | 549.00 | 1.0 | Москва ТРК "Атриум" | DEEP PURPLE Stormbringer (фирм.) | 56 | Музыка - CD фирменного производства |
| 7 | 04.01.2013 | 0 | 25 | 2572 | 239.00 | 1.0 | Москва ТРК "Атриум" | DEFTONES Koi No Yokan | 55 | Музыка - CD локального производства |
| 8 | 11.01.2013 | 0 | 25 | 2572 | 299.00 | 1.0 | Москва ТРК "Атриум" | DEFTONES Koi No Yokan | 55 | Музыка - CD локального производства |
| 9 | 03.01.2013 | 0 | 25 | 2573 | 299.00 | 3.0 | Москва ТРК "Атриум" | DEL REY LANA Born To Die | 55 | Музыка - CD локального производства |
| date | date_block_num | shop_id | item_id | item_price | item_cnt_day | shop_name | item_name | item_category_id | item_category_name | |
|---|---|---|---|---|---|---|---|---|---|---|
| 2935839 | 24.10.2015 | 33 | 25 | 7315 | 399.0 | 1.0 | Москва ТРК "Атриум" | V/A Dance Kick! 2CD (digipack) | 55 | Музыка - CD локального производства |
| 2935840 | 31.10.2015 | 33 | 25 | 7409 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A Nu Jazz Selection (digipack) | 55 | Музыка - CD локального производства |
| 2935841 | 11.10.2015 | 33 | 25 | 7393 | 349.0 | 1.0 | Москва ТРК "Атриум" | V/A Lounge Del Mar 3 2CD (digipack) | 55 | Музыка - CD локального производства |
| 2935842 | 10.10.2015 | 33 | 25 | 7384 | 749.0 | 1.0 | Москва ТРК "Атриум" | V/A Ladies Sing The Blues 3CD | 55 | Музыка - CD локального производства |
| 2935843 | 09.10.2015 | 33 | 25 | 7409 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A Nu Jazz Selection (digipack) | 55 | Музыка - CD локального производства |
| 2935844 | 10.10.2015 | 33 | 25 | 7409 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A Nu Jazz Selection (digipack) | 55 | Музыка - CD локального производства |
| 2935845 | 09.10.2015 | 33 | 25 | 7460 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A The Golden Jazz Collection 1 2CD | 55 | Музыка - CD локального производства |
| 2935846 | 14.10.2015 | 33 | 25 | 7459 | 349.0 | 1.0 | Москва ТРК "Атриум" | V/A The Best Of The 3 Tenors | 55 | Музыка - CD локального производства |
| 2935847 | 22.10.2015 | 33 | 25 | 7440 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A Relax Collection Planet MP3 (mp3-CD) (jewel) | 57 | Музыка - MP3 |
| 2935848 | 03.10.2015 | 33 | 25 | 7460 | 299.0 | 1.0 | Москва ТРК "Атриум" | V/A The Golden Jazz Collection 1 2CD | 55 | Музыка - CD локального производства |
Most frequently occurring
| date | date_block_num | shop_id | item_id | item_price | item_cnt_day | shop_name | item_name | item_category_id | item_category_name | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 01.05.2014 | 16 | 50 | 3423 | 999.0 | 1.0 | Тюмень ТЦ "Гудвин" | Far Cry 3 (Classics) [Xbox 360, русская версия] | 23 | Игры - XBOX 360 | 2 |
| 1 | 05.01.2013 | 0 | 54 | 20130 | 149.0 | 1.0 | Химки ТЦ "Мега" | УЧЕНИК ЧАРОДЕЯ (регион) | 40 | Кино - DVD | 2 |
| 2 | 12.07.2014 | 18 | 25 | 3423 | 999.0 | 1.0 | Москва ТРК "Атриум" | Far Cry 3 (Classics) [Xbox 360, русская версия] | 23 | Игры - XBOX 360 | 2 |
| 3 | 23.02.2014 | 13 | 50 | 3423 | 999.0 | 1.0 | Тюмень ТЦ "Гудвин" | Far Cry 3 (Classics) [Xbox 360, русская версия] | 23 | Игры - XBOX 360 | 2 |
| 4 | 23.03.2014 | 14 | 21 | 3423 | 999.0 | 1.0 | Москва МТРЦ "Афи Молл" | Far Cry 3 (Classics) [Xbox 360, русская версия] | 23 | Игры - XBOX 360 | 2 |
| 5 | 31.12.2014 | 23 | 42 | 21619 | 499.0 | 1.0 | СПб ТК "Невский Центр" | ЧЕЛОВЕК ДОЖДЯ (BD) | 37 | Кино - Blu-Ray | 2 |